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Abstract 

We consider the problem of channel estimation for amplify-and-forward (AF) two-way relay networks 
(TWRNs). Most works on this problem focus on pilot-based approaches which impose a significant 
triiining overhead that reduces the spectral efficiency of the system. To avoid such losses, this work 
proposes blind channel estimation algorithms for AF TWRNs that employ constant-modulus (CM) 
signaling. Our main algorithm is based on the deterministic maximum likelihood (DML) approach. 
Assuming M-PSK modulation, we show that the resulting estimator is consistent and approaches the true 
channel with high probability at high SNR for modulation orders higher than 2. For BPSK, however, 
the DML performs poorly and we propose an alternative algorithm that performs much better by taking 
into account the BPSK structure of the data symbols. For comparative purposes, we also investigate the 
Gaussian maximum-likelihood (GML) approach which treats the data symbols as Gaussian-distributed 
nuisance parameters. We derive the Cramer-Rao bound and use Monte-Carlo simulations to investigate the 
mean squared error (MSB) performance of the proposed algorithms. We also compare the symbol-error 
rate (SER) performance of the DML algorithm with that of the training-based least-squares (LS) algorithm 
and demonstrate that the DML offers a superior tradeoff between accuracy and spectral efficiency. 
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I. Introduction 

Amplify-and-forward (AF) two-way relay networks (TWRNs) |[T| have recently emerged as a spectrally 
efficient approach for bidirectional communication between two terminals. The majority of works on 
TWRNs assume the availability of channel knowledge at the terminals, which makes it important to 
develop efficient channel estimation algorithms. Although some recent works have considered noncoherent 
two-way relaying which avoids channel estimation by employing differential modulation schemes (c.f. Q, 
|[3|), the resulting systems have lower performance than those employing coherent schemes. 

So far, researchers have taken a training-based approach to channel estimation for AF TWRNs, 
estimating the channels using pilot symbols known to both terminals. In [4], the training-based maximum- 
likelihood (ML) estimator was developed for the classical single-antenna single-relay system assuming 
flat-fading channels. In [5], the ML approach was used to acquire initial channel estimates at the relay 
that were employed to denoise the training signal. Power was then allocated at the relay to optimize 
channel estimation at the terminals. The case when the relay and the two terminals are equipped with 
multiple antennas was considered in f^], where initial estimates of the multi-input multi-output (MIMO) 
channels were acquired at each terminal using the training-based least-squares (LS) algorithm, and then 
a tensor-based approach was employed to improve the estimation accuracy by exploiting the structure 
of the channel matrices. Channel estimation for TWRNs in frequency selective environments has also 
been considered in a number of works. LS estimation for OFDM systems was considered in ||7|. A 
nuUing-based LS algorithm was proposed in J8| to jointly estimate the channel and the carrier-frequency 
offset (CFO). Finally, channel estimation and the design of training symbols for TWRNs operating in 
time- varying environments has been considered in |9|, where a complex-exponential basis expansion 
model (CE-BEM) was used to facilitate LS channel estimation. 

The obvious shortcoming of the pilot-based approach is the training overhead which in turn reduces the 
spectral efficiency of TWRNs. Blind channel estimation would avoid this burden, which makes it a more 
spectrally-efficient approach. Little effort, however, has so far been dedicated to develop blind channel 
estimation algorithms for TWRNs. In |10|, a blind channel estimator was proposed for OFDM-based 
TWRNs by using the second-order statistics of the received signal and applying linear precoding at both 



terminals, while in 1 1 1 1 a semi-blind approach to joint channel estimation and detection for MIMO- 
OFDM TWRNs was developed using the expectation conditional maximization (ECM) algorithm. 

In this work, we consider blind channel estimation for TWRNs that employ constant-modulus (CM) 
signaling. Because of its constant envelope, CM signaling permits the use of inexpensive and energy- 
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efficient nonlinear amplifiers. In fact, CM signaling in the form of continuous-phase modulation (CPM) 
is employed in the well-known GSM cellular standard, while 8-PSK modulation is employed in the 



Enhanced Data Rates for GSM Evolution (EDGE) 1 12|. Moreover, QPSK modulation is supported in the 
3rd Generation Partnership Program (3GPP) Long Term Evolution (LTE) and LTE-Advanced wireless 
standards |13j|. CM signaUng has also been employed in a number of works on amplify-and-forward 
relay networks |[14|-|[T7|. In fact, a novel relaying protocol has been recently proposed in |18| whereby 
CM signalling is employed both at the terminals and at the relay which only forwards the phase of the 
received signal to the destination. This protocol is referred to as phase-only forward relaying. 

Assuming nonreciprocal flat-fading channels, we propose a deterministic ML (DML)-based algorithm 
that estimates the channel blindly by treating the data symbols as deterministic unknowns. While the 
proposed algorithm may be applied to any type of CM signaUng, we analyze its asymptotic performance 
assuming that the terminals employ M-PSK modulation. Noting that consistency is not guaranteed for ML 



estimators when the data symbols are treated as deterministic unknowns 1 19 1, |20|, we prove that our DML 
estimator is consistent when the channel parameters belong to compact sets. We also study the asymptotic 
behavior of the DML estimator at high SNR and prove that it approaches the true channel with high 
probability for modulation orders higher than 2. For M = 2, however, the DML estimator performs poorly, 
and we propose an alternative estimator based on the constrained ML (CML) approach which provides 
much better performance by explicitly taking into account the BPSK structure of the data symbols. As 
a simple alternative to the DML approach, we also consider the Gaussian ML (GML) estimator which 
can be obtained by treating the data symbols as Gaussian-distributed nuisance parameters. When CM 
signaling is employed, the GML estimator takes the form of a sample average which is consistent and 
can be updated online but suffers from an error floor at high SNR. To evaluate the performance of our 
estimator, we also derive two Cramer-Rao bounds (CRBs) for our estimation problem. The first bound 
is obtained by treating the data symbols as deterministic unknown parameters and the second is the 
modified CRB |j2]J which treats the data symbols as random nuisance parameters. 

Monte Caiio simulations are used to investigate the performance of the proposed algorithms. For 
M > 2, we show that the DML estimator outperforms the GML estimator at medium-to-high SNR and 
approaches the CRB at high SNR. For M = 2 we show that the CML-inspired estimator outperforms 
the GML estimator except at very low SNR. We also investigate the tradeoffs of following the blind 
approach by comparing the symbol-error rate (SER) performance of the DML estimator with that of 
the training-based LS estimator. We show that the DML approach provides a better tradeoff between 
accuracy and spectral efficiency. 
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The remainder of the paper is organized as follows. In Section [TTj we present our system model. In 



Section III we present the proposed algorithms. In Section IV we analyze the asymptotic behavior of our 



estimators and derive the CRB. We show our simulation results and comparisons in Section |V] Finally, 



our conclusions are presented in Section VI 
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Fig. 1. The two-way relay network with two source terminals and one relay node. 

II. System Model 

We consider the typical half-duplex TWRN with two source nodes, 7i and Ti, and a single relaying 
node TZ, shown in Fig. [T] The network operates in quasi-static flat-fading channel conditions. Each data 
transmission period is divided into two phases. In the first phase, 71 and T2 simultaneously transmit to 
TZ the M-PSK data symbols ti and t2, respectively. The symbols ti and t2 are of the form ti = ^/Ple^'^^ 
and t2 = ^/^e•?'^^ where Pi and P2 are the transmission powers of 71, Ti, respectively, (pi and (f>2 are the 
information-bearing phases randomly and independently chosen from the set Sm = {(2^ — 1)11 /M,i = 
1,...,M}, and ^ 
by 



1. The received signal at the relay during the first transmission phase is given 

r = hih + git2 + n, (1) 



where hi and gi are the complex coefficients of the flat-fading channels 71 — )• 7^ and T2 ^ TZ, 
respectively, and n is zero-mean complex additive white Gaussian noise (AWGN) with variance a^. 
In the second phase, TZ broadcasts the amplified signal Ar, where A is the amplification factor, assumed 
to be known at both terminals as is common practice (cf. Q, ||5|). The amplified signal passes through 
the channels /i2 and g2 to reach 71 and T2, respectively. The complex channel coefficients hi, /i2, gi and 
g2 remain fixed during the estimation period. To maintain an average power of P,. at the relay over the 
long term, the amplification factor is chosen as Q 
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Pr 



P1 + P2 + ol 
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(2) 
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Without loss of generality, we consider channel estimation at node Ti- The received signal at 71 in the 
second transmission phase is 

z = Ahih2ti + Agih2t2 + Ah2n + r/, (3) 

where r] is the zero-mean AWGN term at 71 with variance cj|. We assume that the noise variances o"^ 
and (T^ are unknown at 71- The term Ahih2ti represents the self-interference at 71- In Q, there are 6 
unknown real channel parameters: the magnitudes and phases of the complex parameters hi, h2, and gi. 
However, not all of them are identifiable. By inspecting Q, we see that the channel parameters which 
can be identified (blindly or otherwise) when the noise variances are unknown are[^ |/ii||/i2|> 1 51 1 1 ^2 1 > 
Z/ii + Z/i2, and Zgi + Z^2- On the other hand, it is not possible to identify the individual magnitudes 
|/ii|,|/i2| and \gi\, or individual phases Zhi, Z/i2 and Zgi. In other words, the identifiable channel 
parameters are a = /11/12 and 6 = 51/12 ■ Under the CM assumption, it is sufficient for detection purposes 
to know a and (pi, = Zb. 

III. Proposed Channel Estimation Algorithms 

Estimation is performed at 71 using N received samples, Zi, i = 1, N, of the form given by ([3]). 
The time index i, i = I, . . . , N, is used to indicate the realizations of ti, t2, (pi, 4)2, n, rj, that gave rise to 
each sample Zi. Lej^ z = [zi, . . . , zat]^ be the vector of received samples. This vector can be expanded 
as 

z = Aati + Abt2 + Ah2n + r/, (4) 
where ti = [tn, . . . ,tiNf , t2 = [t2i, ■ ■ ■ ,t2Nf , n= [ni, . . . , hat]^ and rj = [rn, . . . ,r]N]'^ . 

A. Deterministic ML Approach 

To avoid dealing with a complicated likelihood function, we treat the transmitted symbols t2i,i = 
1, . . . , N as deterministic unknowns. We also ignore the finite alphabet constraint that restricts the phases 



(j)2i, i = 1 . . . , N to the set Sm- Nonetheless, the actual statistics of these phases will be used in Section IV 
to analyze the behavior of the resulting estimator. Due to the above assumptions, the DML approach can 
be used to blindly estimate the sums (phi — 4'2i + 4'bi'>' = '^t ■ ■ i but it cannot be used to obtain separate 
estimates of (ph and cl)2i, i = I . . . , N. However, since M-PSK modulation is assumed, the Viterbi-Viterbi 

'in this paper, | ■ | and Z(-) denote the magnitude and phase of a complex number, respectively. 

^In this paper, (■)^ and (•)* are used to denote the transpose and complex conjugate operations, respectively. 
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algorithm {22} may be used to blindly estimate 0^. A few pilots, however, are still needed to resolve the 
resulting M-fold ambiguity in (p^. 

We define the vector of unknown parameters for the DML algorithm as = [a, \ b\, . . . , 4>bN-, ^o]^' 
where a"^ = yl^|/i2p(T^ + is the overall noise variance at 71- With t2 assumed deterministic and ti 
known, the received vector z is complex Gaussian with IE{z} = Aati +Abt2 and covariance ¥,{zz^} = 
a^I. Hence, the log-likelihood function is given by 

C{z; 6) = — \\\z - Aati - Afttsf - A^log (vra^) 

N (5) 

= - A £ - - Ab\VP2e''^'" ' - iVlog (vra^) . 

i=i 

Let d, \h\, a1, be the ML estimates of a, \b\, and i = 1, . . . , A^, respectively. It is straightforward 
to show that (pu = ^{zi — Aatu), which when substituted into C{z\ 6) yields the updated log-Ukelihood 

N / \ 2 



(6) 



Maximizing ([6]) with respect to we get 



\b\ = — , ^ • (7) 

' ' NA^ 

Substituting |6| in place of |6| in (|6]l, we obtain 

1 ^ / 1 ^ V 

C"{z-a,(jl) = :^^[\z^- Aatu\- —^\zk- Aatik\\ - iV log (vtct^) . (8) 

^° i=i \ k=i / 

Hence, the ML estimate of a is given b}|^ 

( N / ^ N \2. 

a = arg min < M^j - Autii\ - ^ kfc - Autik\ >. (9) 

Finally, the ML estimate of the parameter al is neither needed for detection nor for estimating a, \h\, 

^We note that a slightly different estimator may be obtained by using a likelihood function that also takes into account the 
pilot symbols used to resolve the ambiguity in (f)^. Since the required number of pilots is very small, the resulting estimator will 
only be slightly better than ([9}. 
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but is provided for completeness and is given by 

" (10) 



i=l \ k=l ) 



We note that the parameter \h\v& also not needed for detection when CM signaling is used. 

The ML estimator in (|9]) has an intuitive interpretation. Let Zi{u) = Zi — Autu^i = 1, ... be 
the "cleaned" versions of the received signal samples after self-interference has been removed using 
the complex value u as an estimate of a. The signals Zi{u),i = 1, . . . , N are independently-generated 
realizations of the RV z{u) = A{a — u)ti + Abt2 + Ah2n + rj. The quantity 

TV / TV \2 

i=l \ k=l J 

which is the objective function in ([9]), scaled by ]y~l' ^1^° represents the sample-variance of the envelope 



of z{u). We demonstrate in Section IV that the value u = a which completely cancels the interference 
also minimizes the variance of the envelope of z(n). Hence, the variance of the envelope of z{u) may 
be seen as a measure of the level of self-interference. 

By treating the transmitted symbols t2i,i = I, ■ ■ ■ , N as deterministic unknowns, the estimator in (|9]l 
ignores the underlying structure of the phases 02i ,i = 1, . . . , N. A valid question is whether a better blind 
estimator may be obtained by explicitly taking into account the structure of the phases. This question is 
particularly relevant for BPSK modulation, for which we show in Section IV that the DML estimator in (|9]l 



performs poorly as it experiences infinitely many global minima at high SNR. There are two alternative 
blind ML approaches which take into account the phase structure of the data. The first approach is to 
perform ML estimation using the likelihood function f{z \ a,b,a1) which can be obtained by treating 
t2i,i = I, . . . ,N as random parameters and marginalizing over them. Unfortunately, this approach is 
very complicated due to the mixed-Gaussian nature of f{z \ a,b,a^). The other alternative is to follow 
a constrained ML (CML) approach by solving the likelihood function in ([5]) subject to the constraint 
that (/)2i £ SM,i = 1,...,A^. For higher order modulations, it is difficult to apply this approach and 
still obtain a closed-form objective function. However, as we shall see next, this approach becomes more 
feasible for BPSK modulation. 

B. Constrained ML Approach for BPSK 

As we will see shortly, a straightforward application of the CML approach for M = 2 is not feasible 
as it results in a 3-dimensional search. However, it is possible to obtain a CML-inspired estimator which 
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possesses a closed-form objective function by utilizing just a few pilot symbols to estimate the phase (f>h. 

Under BPSK modulation, the CML approach minimizes the same log-likelihood function as in ([5]), 
but subject to the constraint that t2i = ztv^, i = I, . . . , N. We focus on the first term in (|5]) since the 
estimation of is decoupled from the estimation of the remaining parameters. Let 



N 



i=l 



(12) 



be our objective function. The resulting estimate of t2i,i = I, ■ ■ ■ , N, is 



hi = V^2Sgn ( ?Si{e-^'f'^{zi - Aatu)} ) . 



(13) 



Substituting ( [13] ) back into ([12]), we obtain 



N 



N 



J'{z; a, \b\,c^b) = ^\zi- Aatu\^ + NA^\b\^P2 - 2A\b\^2Y,me-'^'{z^ - Aatu)}\. (14) 



i=l 



i=l 



From ( 14 1, we see that the CML estimate of |6| is given by 



1^1. 



(15) 



1=1 



Substituting ([15) into ( [14] ), we obtain the updated objective function 



JV /TV Y 

J"{z; a, <Pi,) = Y,\^i- - ]7 E me-''^'{zi - Aatu)}\ 

i=l \i=l J 



(16) 



which we have to solve for a and (j)^. The CML estimate of cpb in terms of a is the solution of the 
following maximization problem 

AT 



arg max } \zi — AatuW cos(Z(zi — Aatu) 

V'e[0,27r) ^ 
1=1 



(17) 



Unfortunately, the maximization in ( [T7[ ) does not have a closed-form solution, which means that we 
cannot proceed to obtain an objective function that only depends on a, like the one in (|9]). Thus, a strict 
application of the CML approach requires the use of a 3-dimensional search to estimate a and (pb. 



To get around this problem, we propose to replace (pb in ( [16] ) with a pilot-based estimate of and 
then proceed to estimate a. We let xij, X2j and Sj, j = 1, . . . , J, be the J pilot symbols transmitted 
by Ti and Ti and the corresponding samples received at 71, respectively. We may thus estimate cpb by 
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bf, = Z {sj — Aaxij)x2j- Substituting (pb into ([16]), we obtain the following estimate of a 



N 



he = arg min y \zi — Autu 



i=l 



N 











\k=l 





-jZ Yl {Sj-AuXij)x2 



(18) 



We refer to the estimator in ( 1 8 1 as the modified CML (MCML) estimator. 



C. Gaussian ML Approach 

In deriving the DML and the MCML estimators, we treated the data symbols t2i,i = 1,...,N as 
deterministic unknowns. Another approach commonly used to deal with nuisance parameters is Gaussian 



ML estimation |23 1. In this case, the data symbols t2i,i = 1, . . . ,N are treated as i.i.d. complex Gaussian 
random variables with mean zero and variance P2. Under this assumption, the total noise variance becomes 



<7g = + o-Q. The resulting log- likelihood function on a is 



£{z; a) 

Hence, the GML estimate of a is 



AaU 



N 
i=l 



iVlog(7ra2). 



(19) 



1 ^ 



9 N NAPi 



(20) 



i=l 



Thus, the GML estimate of a has the form of a computationally inexpensive sample-average. It can be 
easily updated at run-time by updating the average term as new samples arrive. The GML estimator will 
serve as a low-complexity benchmark with which to compare the performance of the proposed DML and 
MCML estimators. 



IV. Asymptotic Behavior Analysis 

In this section, we analyze theoretically the asymptotic behavior of the proposed estimators. Since the 
parameters |6| and a"^ are not required for detection, our analysis focuses on the estimation of a. Regarding 
the DML estimator, the fact that we treat the phases = I, . . . , N as deterministic unknowns means 
that the number of real unknown parameters for (complex) samples is Because the dimension of 

the parameter space grows linearly with the number of samples, the DML estimator falls within a special 
class of ML estimators that are based on "partially-consistent observations" 1 19] . The estimated parameters 
can be classified into two groups. The first group are the incidental parameters (pbiji = 1, . . . , N. Each 
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incidental parameter affects only a finite number of samples (a single sample in our case). The other 
group of parameters, a, \b\, and a^, affect all received samples and are thus called structural parameters. 
The estimation of structural parameters in the presence of incidental parameters has been investigated in 



the literature |19|, |20|, |24|, |25| and is referred to as the Incidental Parameter Problem. It is well-known 



that the asymptotic properties of ML estimators, such as consistency, which hold when the dimension of 



the parameter space is fixed do not necessarily hold in the presence of incidental parameters 1 19|. It thus 
becomes important to investigate the asymptotic behavior of the DML estimator. We do this by explicitly 
taking into account the fact that (pu and (j)2i,i = 1, . . . ,N are equiprobably and independently chosen 
from the set Sm- Regarding the powers Pi, P2 and Pr, the most common convention in TWRNs is to 
set Pi = P2 = Pr (cf. 0, |[7|, p6[), even though some works use slightly different setups. For instance, 
the authors in p]| choose P2 = 2Pi and Pr = 5(^1 + ^2)- In our analysis we consider the general case 
of Pi = aP2 and Pr = (3{Pi + P2) for some a, /? > 0. 

The first asymptotic property we address is the consistency of the DML estimator, which we prove to 
hold when the parameter spaces of a and h are restricted to compact sets. This is true even for BPSK 
modulation. The second aspect we address is how the DML estimator behaves at high SNR (we will 
define SNR shortly). We show the estimator approaches the true channel with high probability at high 
SNR for M > 2, while it encounters infinitely-many global minima at high SNR for M = 2. 

We also analyze the high SNR behavior of the MCML estimator, showing that it approaches the true 
channel with high probability for J = 1 (single pilot). For J > 1, the pilots can be chosen such that the 
MCML estimator always approaches the true channel at high SNR. 

A. Consistency of the DML Estimator 

In this section, we study the behavior of the DML estimator as — 00. We demonstrate that the 
DML estimator is consistent when the channel parameters a, h belong to compact sets. This holds for 
any M > 2 and for any Pi, P2, (^r^^t- 

Before proceeding, we note that the estimator of a in (|9]) belongs to the class of extremum estimators. 



An estimator is called an extremum estimator (cf. |27|, |28|) if there is an objective function T.n{uj 



such that id = arg minllAr(w), where ^7 is the set of parameter values. The fundamental theorem for 
the consistency of extremum estimators can be summarized as follows: 



Theoem 1 (Newey and McFadden p8| i Ch. 36, Theorem 2.1]). If lo belongs to a compact set Vl and 
Sjv(i^) converges uniformly in probability to SoC"^) as N ^ 00, where So(a;) is continuous and uniquely 
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minimized at 00 = ojo, then Co converges in probability to Uq. 



Thus, we need to establish that, as — cxd, the objective function Wn{u) in ( |TT| ) converges unifonnly 
in probabiUty to a deterministic function of u which has a unique global minimum at u = a. Letting 

V = a — u, and Vn{v) = WN{a — v), we obtain 

N / N \ ^ 

1=1 \ k=l / 

where yi{v) = Zi{a — v). The terms yi{v), i = 1, . . . , N are independently-generated realizations of the 
RV y{v) = z{a — v), and Vn{v) is the sample variance of the envelope of y{v). Let V{v) be the variance 
of |y(v)|, (fk — and 9kiv) = (py — cpb + for k = 0, . . . , M — 1. We show in Appendix [a| that 

V(^;) = ^2H'Pi+A2|6|2p2 + a,2- fA/^''£'L./.(-AfeW)') , (22) 

\ k=0 ) 



where ^i/2(-) is the Laguerre polynomial |29| with parameter 1/2, and 

\k(v) = ^ (^>|2pi+^2|j|2p2 + 2A>||6|/p^cos(0fc(7;))) . (23) 

The behavior of V{v) for v = 0, which corresponds to u = a, is described in the following lemma: 

Lemma 1. The variance V{v) of the random variable \y{v)\ has a unique global minimum occurring at 
v = 0. 

Proof: See Appendix [B] ■ 
To apply Theorem 1, it remains to establish that Vn{v) converges uniformly in probability to V{v). 
Since Vj\[{v) is the sample variance of \y{v)\ and \y{v)\ has a finite fourth central moment, Vn{v) 



converges in probability to V{v) |30|. The following lemma holds regarding uniform convergence in 



probability which is a stricter requirement than convergence in probability: 

Lemma 2. Suppose a, b and v all belong to compact sets, then Vm{v) converges uniformly in probability 
to V{v). 

Proof: See Appendix |C] ■ 
From Lemma [T] and Lemma |2j we see that all of the conditions of Theorem 1 are met if the channel 
parameters a and b belong to compact sets. Hence, the following theorem holds. 

Theorem 2. If the channel parameters a, b belong to the compact sets Bi and B2, then the following 
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estimator of a 



IS consistent. 



a = art 



( N / ^ N \2. 

mm I ^ \Azi - Autii\ - ^ '^'^ ~ ^utik]] |- 



The compactness assumption for a and h can be satisfied by assuming that the magnitudes of gi, 
hi and /i2 are bounded. If we treat gi, hi and /i2 as complex Gaussian random variables, there is no 
upper bound on \a\ and strictly speaking, but we can always choose a sufficiently large ^ such that 
Prob{\a\, \h\ = 1 — e, where e can be made arbitrarily small. 

B. High SNR Behavior of the DML Estimator 

We now investigate the behavior of the DML estimator at high transmit SNR. To simplify our 
presentation, we assume that = = a"^. This assumption allows for a simple and intuitive definition 
of SNR as 7 = ^. Let 

N / ^ N \ 2 

^(^) = l^^^i* + ^^*2i| -J^Yl l^^^i^ + ^^*2fc| (24) 

i=l \ k=l J 

be the objective funcion in (|9]l in the limit as o" — )• 0. The following lemma describes the behavior of the 
DML estimator as tr — 0. 

Lemma 3. For fixed (finite) N, the DML estimator approaches the true channel a as a ^ 0, except when 
the data symbols are such that the phase differences (pu — (j)2i , i = 1, . . . , N take at most two distinct 

values, in which case the objective function encounters infinitely many global minima as a ^ 0. Therefore, 

/ o \ N—1 

assuming M-PSK transmission, a ^ a as a ^ with probability Pm.n = 1 — (^-^j (M — 1). 

Proof: See Appendix [D] ■ 
As we can see from Lemma [3} the DML estimator approaches the true channel except in the event 
that the phase differences (pu — (j)2i, i = 1, . . . ,N happen to take at most two distinct values. When 
this event occurs, the objective function of the estimator will have infinitely many global minima at high 
SNR. In fact, the occurence of this event also results in a singular Fisher information matrix as we shall 



see in Section IV-E For M > 2, this event is highly unlikely for sufficiently large N . For instance, at 
M = 4, the probability of this event is less than 5.6 x 10^^ for N > 30. Thus, the DML estimator 
approaches the true channel with high probability for M > 2 as long as the sample size is not extremely 
small. In this case, the average MSE performance of the estimator keeps improving with SNR, and it 
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can effectively achieve arbitrary accuracy for sufficiently high SNR. For M = 2, however, P2.N = 
and the estimator always encounters infinitely-many global minima at high SNR because the difference 
(f'li ~ 4'2i cannot take more than two distinct values for i = 1, . . . ,N. In fact, for M = 2 and for any 
non-zero v, b, there are only two possible values that the terms {vtu + bt2i\, i = 1, . . . , N can take, and 
they are \y/Piv + \/^b\ and \^/PlV — ^/B2b\. Wheneveij^ti _L b, these two values are equal, which means 
that the terms {vtu + bt2i\, i = 1, . . . , N are all equal regardless of the values of tu, t2i, i = 1, . . . , N. 
Therefore, all the values of v such that v -L b are global minimizers of the objective function X{v), 
and the estimator is not able to identify the true channel. Hence, the DML estimator performs poorly 
for M = 2 as its MSB performance degrades at high SNR. Despite this behavior, the DML estimator is 
consistent for M = 2 for any fixed a > 0, because for o" > only v = minimizes the variance V{v). 
This explains why, as we shall see, the DML estimator performs better at low SNR than high SNR for 
M = 2. Since the performance of the estimator will also degrade for very low SNR, the MSB for M = 2 
exhibits a U-shaped behavior when plotted versus SNR. This is confirmed by our simulation results in 
Section |Vl 

We note that similar high SNR behavior is encountered if the noise variances o"^ and are assumed 
to be different (but fixed) and the transmit power P2 becomes asymptotically high. 



C. High SNR Behavior of the MCML Estimator (M-2) 

We now investigate the high SNR behavior of the MCML estimator and show that by taking into 
account the BPSK structure of the data symbols it effectively avoids the problem of infinitely many 
global minima at high SNR. Let 

N ^ /TV \2 

Y{v) = \Avtu + Abt2i\'^ ~n\Y^ I^^*!*^ + ^^*2fc| • I cos(Afe)| (25) 

t=l \k=l ) 



be the objective function in ( fTS] ) as cr 0, expressed in terms oi v = a — u, where 

J 

Afc = Z(Avtifc + Abt2k) - AJAbP2 + AvY^ xijX2j). (26) 

i=i 

Clearly, Y(v) > and Y{0) = 0. It remains to investigate whether it is possible to have Y{v) = for 
V 0, i.e., whether there could be other global minima besides v = 0. For this to happen, the terms 
\Avtik + Abt2k\, k = 1, . . . ,N have to be all equal and the terms | cos(Afc)|, k = I, . . . , N have to be all 

''The notation a; _L y means that Zx — Zy = ^ mod vr. 
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equal to 1. The first requirement can be met if u _L 6 or if the product iijt2i is the same for i = 1, . . . , A^. 
As for the second requirement, it is only satisfied when 

J 

^{Avtik + Abt2k) = ^{JAbP2 + Av'Y^ xijX2j) mod n (27) 



for k = 1, . . . ,N, where mod is the modulo operator. Let V. be the event that the requirement in p7| ) 
is satisified for k = 1,...,N. For J = 1, H occurs with probability P{n) = 1/2^. For J > 1, 
however, the occurrence of T-L can be completely avoided if the pilots are chosen such that the products 
xijX2j, i = 1,..., J are not all equal. If the pilots are randomly chosen, then P('H) = 1/2^+"'^^. 
Hence, depending on the choice of the pilots, the MCML estimator either always approaches the true 
channel or approaches it with a probability of 1 — 1/2^+"^^^. Thus, the MCML estimator effectively 
resolves the problem of infinitely many global minima at high SNR even when a single pilot is used. 

D. MSE Performance of the GML Estimator 



The estimator of a in ( [20| ) can be expanded as 

N 1 ^ 1 ^ 

1=1 1=1 1=1 

It is straightforward to check that the estimator is unbiased. The resulting MSE is 

Since E{|dg — ap} — )• as — )• oo, the estimator is consistent. Clearly, as a — )• 0, the MSE performance 

|6|2 

of this estimator is limited by an error floor of 
E. Cramer-Rao Bounds 

In this section, we obtain CRBs for the estimation of a and \b\. The first bound is the standard CRB 
derived by treating the data symbols ^21, • • • ,t2N as deterministic unknowns. We exclude the parameter 
from our CRB derivation since its Fisher information is decoupled from the Fisher information of the 
other parameters. Let Or = [K{a}, 9{a}, . . . , (j)hNV be the vector of real unknown parameters 

(excluding a1), and let I{6r) be the corresponding Fisher information matrix (FIM). The matrix I{9r) 
is given by 

Hi H2 
H2 Hs 



(30) 
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where 



and 



2A^NPi 
2A^NPi 



2A^^{e^^H^t2} 
2A^^{e^'i'H^t2} 
2A^NP2 



H 



2 - ^ 



2A^^{b*tiitl-^} ... 2A^'^{b*tiNt*2j^} 
2A^^{b*tnt*2^} ... 2A'^^{b*tiNt*2N} 
... 



Hs = ^2A^\b\^P2lN. 



The resulting CRBs on the estimation of a and |6| ar( 



CRBa = [i"-'(0ij)]ll + [I-'(0fl)]22, 



(31) 



(32) 



(33) 



(34) 



and 



(35) 



Moreover, using the Schur-complement property and letting H. = i3"i — H2H^ ^H2, we obtaiij^ 

CRBa = [H'\i + [H~\2, (36) 



and 



CRBi 



\b\ 



J33- 



(37) 



The bounds CRBa and CRB^ exist whenever H is invertible. It can be shown that det(^) = 
whenever the data symbols are such that the differences — 4)21, i = 1, . . . , N take at most two distinct 
values, (recall that this is the same condition which results in infinitely many global minima at high SNR). 
Hence, the bounds do not exist for BPSK modulation since this condition is always met for M = 2. 

The bounds CRBa and CRB\i,\ are both functions of ti and t2, and thus they hold for the particular 
realizations of ti and t2 under consideration. In Section |Vj we use Monte-Carlo simulations to average 
CRBa and CRB^^ over many realizations of ti and t2. 



^The notation [A]ij is used to refer to thie (i,j)tli element of the matrix A. 

^Although! it is possible to evaluate the expressions in \36\ and |37} and obtain closed-form expressions for CRBa amd 
Ci?-B|i,|, the resulting expressions are quite lengthy, and are omitted for the sake of brevity. 
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Another variant of the CRB, commonly used in the presence of random nuisance parameters is the 



Modified CRB (MCRB) |21|. In deriving the MCRB, we compute a modified FIM (MFIM) by first 



extracting the submatrix of the conventional FIM which corresponds only to the nonrandom parameters 
(3?{a}, 9{a} and and then obtaining the expectation of this submatrix with respect to the random 
nuisance parameters (t2 in our case). Let 6' = [5}?{a}, 9{a}, the MFIM is 



Im{e') ^ E{H,} 



A^NPx 
2J^NPx 
2A^NP2 



(38) 



Hence, 



MCRBa 



and MCRB 



2A'^NP2 ' 



(39) 



The bounds in ( 39 1 have the advantage of possessing a simple closed form. However, they are not as tight 



as the standard CRB and for this reason we will only use the bounds in ( [36| ) and pJ} in our simulation 
results. 



V. Simulation Results 

In this section, we use Monte-Carlo simulations to numerically investigate the performance of the 
proposed algorithms. Our results are obtained assuming Pr = Pi = P2, and they are averaged using 
a set of 300 independent realizations of the channel parameters hi, /i2, gi and §2- These realizations 
are generated by treating hi and h2 as correlated complex Gaussian random variables with mean zero, 
variance 1 a correlation coefficient g = 0.3, and also treating gi and g2 as correlated complex Gaussian 
random variables with the same mean, variance and correlation coefficient, but independent of hi and /i2. 
To generate correlated complex Gaussian random variables we follow the approach proposed in [31 1. The 
ML and MCML estimates are obtained using a two-dimensional grid-search with a step-size of 10^'^. As 
before, the SNR is defined as 7 = Unless otherwise mentioned, MSE results are for the estimation 
of a. 

We begin by comparing the MSE performance of the DML, GML and MCML estimators for M = 2. 
We do not show the CRB in this case, since the FIM is singular. The MSE performance of the three 
estimators is plotted versus SNR for = 45 in Fig. |2] and versus A^ for an SNR of 20 dB in Fig. [3] For 
the MCML estimator, 2 pilots are employed to obtain an estimate of Both plots show that the DML 
estimator performs poorly for BPSK modulation and is outperformed by the GML and MCML estimators. 
Fig. [2] shows the U-shaped behavior of the DML estimator for M = 2 described in Section IV-B 
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The MCML estimator is superior to the GML estimator except at very low SNR. Moreover, as SNR 
increases, the MCML estimator improves steadily while the GML estimator hits an error floor. Fig. [3] 
also demonstrates the superiority of the MCML estimator to the GML estimator. 

Next, we study the MSB performance of the DML and GML estimators for M = 4 (QPSK modulation). 
The MSB performance of the two estimators is plotted versus SNR for = 45 in Fig. |4] and versus A'^ 
for an SNR of 20 dB in Fig. [5] The bound CRBa is shown as a reference in both plots. As we can 
see in Fig. |4| the DML estimator outperforms the GML estimator, except at low SNR. Moreover, the 
MSB performance of the DML estimator improves steadily with SNR and approaches CRBa, while that 
of the GML estimator hits an error floor at high SNR. In Fig. |4| it is also noticed that the MSB of the 
GML estimator goes below CRBa at low SNR. This should not be a surprise since CRBa is derived 
by treating ±2 as deterministic, and the GML estimator is biased in this case. As we can see in Fig. [5j 
both estimators improve as N increases, but the DML estimator is much closer to CRBa- 

In Fig. [6| the MSB performance of the DML and the GML estimators for the estimation of |6| and the 
associated CRB are plotted versus SNR for M = 4. For the GML estimator, we obtain an estimate of 
|6| by substituting hg in (|7]l. Fig. |6] shows that the GML estimator is slightly better except at high SNR 
where it appears to hit an error floor and the DML estimator becomes better and approaches CRB\^. It 
must be noted that |6| plays no role in detection when M-PSK modulation is used. 

Our next goal is to compare the SBR performance of the DML estimator with that of the training- 
based LS estimator in order to investigate the tradeoffs between accuracy and spectral efficiency that 
result from following the blind approach. We focus on small sample sizes since this is more suitable 
for modem-day cellular systems. We note that, when channels are nonreciprocal, the training-based LS 
estimator is an efficient estimator which coincides with the training-based ML estimator. As a reference, 
we also plot the SBR performance under perfect channel knowledge. The phase is estimated blindly 
using the Viterbi-Viterbi algorithm^ and a small number of pilots is employed to resolve the resulting 
M-fold ambiguity using the unique word method |32|. In Fig. [TJ we show the SBR performance of the 



two algorithms assuming that the channel is fixed for the duration of 20 samples. For the DML estimator, 
2 pilots and 18 data symbols are transmitted. The 2 pilots are used to resolve the M-fold ambiguity, and 
all 20 samples are used to blindly estimate a. For the LS estimator, we estimate a and b using 4 pilot 

'Using the Viterbi-Viterbi algorithm, we estimate cjn by 4>b = jj^T-^^i |^i(a)peJ^^*'^"^+'', where ii{a),i = l,...,iV 
are the resulting A'^ samples after the estimate a is used to remove self-interference. 
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symbolq^Jand we transmit 16 data symbols. As we can see from Fig. [Vj the SER performance of the DML 
estimator is very close to that of the LS estimator (approximately 0.6 dB away). In Fig. [8} we assume 
that the channel is fixed for the duration of 40 samples. In the DML case 4 pilots and 36 data symbols 
are transmitted, and in the LS case 8 pilots and 32 data symbols are transmitted. The performance of 
DML estimator is again very close to that of the LS estimator (approximately 0.4 dB away), and it is 
only 1.5 dB away from the performance under perfect CSI. In both examples, for the DML algorithm 
we use 90% of the channel coherence time to transmit data and 10% to transmit pilots, while for the 
LS algorithm we are use 80% of the coherence time to transmit data and 20% to transmit pilots, which 
demonstrates that the DML estimator offers a better tradeoff between accuracy and spectral efficiency. 

It is worth noting that the channel coherence times considered in our SER studies are compatible with 
current wideband cellular systems. For instance, considering a wideband system with 256 carriers and 
a sampling frequency of lO^^s, the channel has to be invariant for 5.12ms in the first example and for 
10.24ms in the second example. Assuming a central frequency of 2.4GHz and a vehicle speed of lOm/s, 
the coherence time is around 12.5ms, which is larger than the required coherence time in both examples. 

VI. Conclusions 

In this work, we proposed the DML algorithm for blind channel estimation in AF TWRNs employing 
Af-PSK modulation. The DML estimator was derived by treating the data symbols as deterministic 
unknowns. For comparison, we derived the GML estimator by treating the data symbols as Gaussian- 
distributed nuisance parameters. We showed that the DML estimator is consistent. For M > 2, we showed 
that it approaches the true channel with high probability as SNR increases and that its MSE performance 
is superior to that of the GML estimator for medium-to-high SNR and approaches the derived CRB at 
high SNR. In contrast, the GML estimator suffers from an error floor at high SNR. We also compared 
the SER performance of the DML estimator with that of the training-based LS estimator and showed 
that the DML approach provides a better tradeoff between accuracy and spectral efficiency. For M = 2, 
however, we showed that the DML estimator performs poorly and proposed the MCML estimator which 
explicitly takes into account the structure of the BPSK signal. This estimator outperforms the GML 
estimator except at very low SNR and approaches the true channel at high SNR. 

^For the training-based LS algorithm, the pilots are chosen such that the pilot vectors from the two terminals are orthogonal 
to each other. 
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Appendix A 

In this appendix, we derive the closed-form expression for the variance V{v) of the RV |^/(^^)|. We 
recall that 

y{v) = Av^fPxe^'^^ + AftV^e^'^' + Ah^n + 77. (40) 
The variance of \y{y)\ is V{v) = E{|y(f)p} — E{|y(i')|}^. It can be easily shown that 

E{\y{v)\^} = A^\v\^Pi + A^\b\^P2 + al (41) 
To find a closed form for E we first obtain the conditional expectation E {|y(f )| | (t)i,<p2 }■ We 



can see from (40 1 that ^{y{v)} and conditioned on (j)i and (f)2, are Gaussian-distributed with 



conditional means 

E{K(y(y)) I (Pi,^2} = A\v\y/FiCOs{(l)^ + + A\b\y/P2COs{(t)b + <P2), 

and 

E{9(y(i;)) I (Ai,(/>2} = ^|z;|/f\sin((/.^ + </.i)+^|6|V^sin(0b + 02) 
and a conditional variance of Hence, when conditioned on (pi and (/)2, \y{v)\ is a noncentral Chi 



random variable whose mean is given by |33| 



lE{|yWI I <t>iA2} = \l-^L,„{-\{v-<Pi,^2)), 

where 

X[v;cPu<P2) = \[Emv{v)) |<Ai,<A2}' + E{9(y(^;)) \cP1A2?) 

I / X ^^^^ 

= ^ + ^2|6|2p2 + 2AM\h\^/F\P2COs{(t)y -ct>b + ^i- (t)2)) 

and Li/2(x) = e^/^ [(i - x)Iq{x/2) + xIi{x/2)] is the La guerre polynomial with parameter 1/2, and 
Ir{-) is the Modified Bessel Function of the First Kind of order r [29|. The last equation shows that 
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X{v; 01, (1)2) depends on the difference {(j)i—(j)2) mod 27r, i.e., \{v; cpi, (j)2) = X{v; (/>!— </>2). The difference, 
{(pi — 4>2) mod 27r takes the values (pk = /c = 0, . . . ,M — 1 with equal probabiUty. Let \k{v) = 
X{v]4>i — ct>2 = ^k), the unconditional mean E{|y(u)|} is given by 

M-l 



n\yiv)\}=Yl 



k=0 



4M2 



-^^1/2 (-Afc(v)) , 



(43) 



which completes the proof of (22 1. 



Appendix B 



In this appendix, we show that V(f ) has a unique global minimum at u = 0. Let u = {A'^\v\'^Pi), 
C ^ ^^(A2|6|2P2), and let 



vr 
4M2 



J] (- [ly^ + e + 2iyCcos{ek{v))]) 



(44) 



, A:=0 



It is sufficient to prove that D{u) has a unique global minimum at = 0. It is straightforward to verify 
that Li/2(— x) is a strictly concave function. Therefore, 



AI-l / M-l 

- ^1/2 (- F + C + 2<cos(0fc(7;))]) < E + + 2Kcos(0fc(^;))] 



(45) 



Af-l 

where we have used the fact that ^ cos( 

A:=0 



+ ^) = 0. Therefore, 



(46) 



For C / 0, the equality holds if and only if = 0. Let F{u) ^ v"^ + C - \ {L,/2 (- [z^^ + C^]))^ it is 
sufficient to show that F{u) has a unique global minimum at v = 0. We have 

d 



dv 



F{v) =2U- UnU/, (- + C']) P(z.2 + ^2)^ 



(47) 



where = ie'^/^ (/o(a;/2) + /i(a;/2)). Since = for i/ = 0, it is sufficient to show that 

> for 7^ 0. In other words, we have to show that f (- [^"^ + C])V{u'^ + C^) < 1- Let 

p = 1/2 + ^2 



TT 



Q{p) = i^L,„{-p)V{p) 
-p 



2 

TT 



(i + P)/o(f)+p/.(f 



(48) 
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we have to show that Q{p) < 1 for p > 0. We will do this by showing that j^Q{p) > and that 



lim Q{p) = 1. We have 



dp 



Qip) 



vr 



(49) 



Let U{p) = pk 



\2 



- ph (f)' - 2/o (f ) h (f ). Hence fO{p) = fe-PU{p). Moreover, lu{p) 



1 



'0 l2;^l ^2. 



dp 



(f ) /i ) > 0. Thus, ?7(p) is strictly increasing for p > 0. Since U (0) — 0, we have that U(^p) > 
for p > 0, which impUes that > for /? > 0. 

It remains to show that lim Q{p) = 1. For large arguments, the Ir{-) has the following asymptotic 

p-^OO 



expansion |29| 



Irix) 



V2 



TTX 



^ e-l (g-l)(g-9) 
8x 2!(8a;)2 



where e = 4r^. We rewrite Q{p) as 



Q{p) 



TT 



(1 + p)Io (I) ' + (1 + 2p)/o (0 /i (0 + Ph 



(50) 



(51) 



Using the expansion in (50 1, we obtain 



lim5e-(l + p)/o(0^=lim Je-Vi(£' ' 



(52) 



and 



(53) 



Therefore, lim Q{p) = 1, which completes the proof. 

p—^OO 



Appendix C 

In this appendix, we prove that V]\j{v) converges uniformly in probability to V{v) when a, b and v 
belong to compact sets. Suppose t; G C. By definition, Vn{v) converges uniformly in probability to V{v) 
when sup^,£(j |VAr(w) — V{v)\ converges in probability to zero as — )• oo. Since yi{v) = Zi{a — v) = 
Zi — A{a — v)tii, Vn{v) is a function of the parameter v, the observations z and known data symbols ti. 
According to Lemma 2.9 in [28, Ch. 36], a sufficient condition for uniform convergence in probability 
when C is compact is the existence of a function TN{z,ti) with bounded expectation E{J^Ar(z, ti)} 
such that for all ^1,1^2 £ C, |VAf(^^i) — ^Af(^'2)| < J^N{z,ti)\vi — ^2!. 
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\Vn{vi) - 



-M-2)|<^E 

i=l 
1 ^ 

=—y 

AT- 1 ^ 

i=l 

X 



1 ^ 1 ^ 

- |yi(^^2)| - ;^ 5Z \yk{vi)\ + j^Yl 
fc=i fe=i 



1 ^ 



1 ^ 

\yi{vi)\ + \yi{v2)\ - 



For i = 1 



and 



We can 



,...,A^, let 



\yi{vi)\ - 



1 ^ 

\yi{v2)\ - J;^^\yk{vl)\- 
k=l 



1 ^ 1 ^ 

j^Y.\ykivi)\ - j^Y^\yk{v2)\ 

k=l k=l 



\yi{vi)\ + [yd 



1 ^ 
fe=i 

use the triangular inequality 



1 ^ 
fe=i 

1 ^ 
fc=i 



' to get 



1 ^ 

Tj(vi,t;2) < lyiCui) - yi{v2)\ + J^Y1 \yki'"i) ■ 

k=l 



-yk{v2)\ 



and 



= 2^Pi|?;i- 



- V2\, 



1 ^ 

+ |y»(^^2)| + ^5]|yfc(t;i)| 
fe=l 

Noting that \yi{'i 



Ai{vi,v2) < \yiivi)\ - 



1 



(«)l < \zi 



(54) 



(55) 



(56) 



(57) 



(58) 



N 

■ j;^^\yk{v2)\. 

k=l 

I + A|a|Pi + ^Pil^l, we further obtain 

2 ^ 

Ai(t'i,'y2) < 2\zi\ + j;^Yl + 4APi|a| + 2^Pi(|^;i| + \v2\). (59) 

k=l 

Since the set C is compact, there exists U > such that 1^1 < W, Vv G C. Hence, we obtain the following 
upper bound on Ai('yi, ^2): 

2 ^ 

Ai(^^i,^^2) < 2|2;i| + j^Yl l^fcl + 2^^il«l + (60) 

fc=i 
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Combining (57 1, and (60 1, we obtain 



\Vn{vi)-Vn{v2) \ < 



8APi 
N-l 



N 



ANPiU + ANPi\a\ + ^ | 



i=l 



\Vi - V2\ 



(61) 



Noting that Zj = yi{a), we have from (43 1 that 



{A\a\Pi + A\h\P2)' 



(62) 



Since both a, and b belong to compact sets, there exists A^i > and 7W2 > such that \a\ < Aii and 
|6| < M2- Hence, 

(63) 



Letting, Q 



1~ 



]E{|^i|} < ^-^{AMiPi + AM2P2) 



Li/2 (-^ {AMiPi + AM2P2f), we obtain 



8^Pi 



N 



N-l 

which completes the proof. 



ANPiU + ANPi\a\ + "^1 



i=l 



(64) 



Appendix D 

In this appendix, we prove Lemma |3] It is obvious from (|24]) that X{v) > with equality if and only 
if the terms {Avtu + Abt2i\, i = 1, . . . , are all equal. Because of the constant modulus nature of the 
data symbols, this occurs at ?; = for any M, which means that there is always a global minimum at 
V = (i.e., at u = a). To see whether we also have X{v) = for some 7^ 0, we rewrite \Avtii + Abt2i\ 
as 

\Avtii + Abt2i\ = \A\v\^/Pie^'^-+^^^ + A\b\^/P2e^^'+*^^\ 
= \A\v\^ie^'^--^'^^'^-'^'' + A\b\^/P2\. 



(65) 



Let Xi = cos((/)t, — 4>b + (f)ii — (f)2i), i = 1, . . . , A^. It is clear from (65 1 that X{v) is zero whenever the 
terms Xi) i = 1, . . . , are all equal. We denote by £ the event that the terms Xi) « = 1, . . . , are 
equal. Let t/ji = 



2i, then the values ipi, i = 1, . . . , N are i.i.d. realizations of the discrete random 

jyj-,^, — 0, ...,M — 1} with equal probability. We 



variable ^ which takes values from the set S'^i = { ^^'^ 



will now show that the event £ occurs if and only if the values ipi, i = 1, . . . , N are chosen from the 
same size 2 subset of Sx^t- Suppose that tpi is fixed, and that we are choosing the remaining phases such 
that cos{ipi + 1?) = cos(V'i + T?) for i = 2, . . . , A^. If is different from ipi for some index k, then 
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cos{'tpK + (l>v — (j^b) = cos('0i + 4>v — (j^h) Can only be satisfied if 2((^„ — <^(,) = (— V'l — V'k)- For the 
remaining phases with indices i = 2, . . . , N, i ^ k, the equaHty cos{il^i + 0^ — <^b) = cos{ipi + <^v — M 
holds only if il^i = ipi or V'j = V'k- Therefore, the terms Xij i = 1, ■ ■ ■ , N are equal if and only if the 
phases ipi, i = 1, . . . ,N take at most two distinct values, i.e, the probability that £ occurs is 

where (^) is the number of distinct subsets of size 2 that can be chosen from a set of size M. Suppose 
that £ occurs and that ipj: is the sum of the two distinct phase values, then X{v) = for all the values of 
V that satisfy 2(^^ — ^{,) = — 'i/'s, which means that there are infinitely many global minimizers of X{v). 
Hence, the probability that X{v) has a unique minimum at = is Pm,n = 1 — [j^]'^ ^ (M — 1). 
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Fig. 2. Average MSB of the DML, GML and MCML algorithms for the estimation of a, plotted versus SNR for M = 2 
(BPSK modulation) and TV = 45. For the MCML algorithm, we use two pilots to estimate 4>b. 




Fig. 3. Average MSB of the DML, GML and MCML algorithms for estimation of a, plotted versus N for M = 2 and an 
SNR of 20 dB. For the MCML algorithm, we use two pilots to estimate 4>b. 
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Fig. 4. Average MSB of the DML and GML algorithms for the estimation of a and the bound CRBa plotted versus SNR for 
M = 4 (QPSK modulation) and A*' = 45. 
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Fig. 5. Average MSB of the DML and GML algorithms for the estimation of o and the bound CRBa plotted versus N for 
M = 4 and an SNR of 20 dB. 



March 5, 2012 



DRAFT 



28 




SNR 



Fig. 6. Average MSB of the DML and GML algorithms for the estimation of |6| and the bound CRB\),\ plotted versus SNR 
for M = 4 and iV = 45. 
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Fig. 7. Average SER versus SNR for the DML and LS estimators for M — 4, assuming the channel is fixed for the duration 
of 20 samples. We use 2 pilots to resolve the M-fold ambiguity in (pt for the DML estimator, and 4 pilots to estimate a and b 
for the LS estimator. Also plotted is the SER performance under perfect CSI. 
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Fig. 8. Average SER versus SNR for the DML and LS estimators for M = 4, assuming the channel is fixed for the duration 
of 40 samples. We use 4 pilots to resolve the M-fold ambiguity in 0;, for the DML estimator, and 8 pilots to estimate a and b 
for the LS estimator. 



March 5, 2012 



DRAFT 



